Joshua 4.0: Packing, PRO, and Paraphrases
نویسندگان
چکیده
We present Joshua 4.0, the newest version of our open-source decoder for parsing-based statistical machine translation. The main contributions in this release are the introduction of a compact grammar representation based on packed tries, and the integration of our implementation of pairwise ranking optimization, J-PRO. We further present the extension of the Thrax SCFG grammar extractor to pivot-based extraction of syntactically informed sentential paraphrases.
منابع مشابه
External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages
With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique and existing methods suffer from lack of producing accurate queries, Precision and Speed of retrieved result...
متن کاملAnalysis of designed β-hairpin peptides: molecular conformation and packing in crystals.
The crystal structures of several designed peptide hairpins have been determined in order to establish features of molecular conformations and modes of aggregation in the crystals. Hairpin formation has been induced using a centrally positioned (D)Pro-Xxx segment (Xxx = (L)Pro, Aib, Ac6c, Ala; Aib = α-aminoisobutyric acid; Ac6c = 1-aminocyclohexane-1-carboxylic acid). Structures of the peptides...
متن کاملA Comparison of Optimization Methods for Multi-objective Constrained Bin Packing Problems
Despite the existence of e cient solution methods for bin packing problems, in practice these seldom occur in such a pure form but feature instead various considerations such as pairwise con icts or pro ts between items, or aiming for balanced loads amongst the bins. The Wedding Seating Problem is a combinatorial optimization problem incorporating elements of bin packing with con icts, bin pack...
متن کاملComputational Study of Packing a Collagen-Like
The lateral packing of a collagen-like molecule, CHJO(Gly-L-Pro-L-Pro),NHCH,, has been examined by energy minimization with the ECEPPN force jield. Two current packing models, the Smith collagen microjibril twisted equilateral pentagonal model and the quasi-hexagonal packing model, have been extensively investigated. In treating the Smith microjibril model, energy minimization was carried out o...
متن کاملExtracting Paraphrases from a Parallel Corpus
While paraphrasing is critical both for interpretation and generation of natural language, current systems use manual or semi-automatic methods to collect paraphrases. We present an unsupervised learning algorithm for identification of paraphrases from a corpus of multiple English translations of the same source text. Our approach yields phrasal and single word lexical paraphrases as well as sy...
متن کامل